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(54) Title: HEADTRACKED PROCESSING FOR HEADTRACKED PLAYBACK OF AUDIO SIGNALS 



(57) Abstract 

A method of simulating a spa- 
tial sound environment to a listener 
over headphones is disclosed com- 
prising inputting a series of sound sig- 
nals having spatial components; de- 
termining a current orientation of the 
headphones around the listener, de- 
termining a mapping function from a 
series of spatially static virtual speak- 
ers placed around the listener to each 
ear of the listener, utilising the cur- 
rent orientation to determine a cur- 
rent panning of the sound signals to 
the series of virtual speakers so as 
to produce a panned sound input sig- 
nal for each of the virtual speakers; 
utilising the mapping function to map 
the panned sound input signal to each 
ear of the listener, and combining the 
mapped panned sound input signals 
to produce a left and right output sig- 
nal for the headphones. 
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Head-tracked Processing for Headtracked Playback of Audio 



Field of the Invention 

The present invention relates to the creation of 
spatialized sounds utilizing a headtracked set of 
headphones . 



Methods for localizing sounds utilizing headphones and 
a headtracking unit are known. For example, in US Patent 
Serial No. 08/723,614 entitled "Methods and Apparatus for 
Processing Spatialized Audio", there is disclosed a system 
for virtual localization of a sound field around a listener 
utilizing a pair of headphones and a headtracking unit 
which determines the orientation of the headphones relative 
to an external environment. Unfortunately, the disclosed 
arrangement requires a high computational power or resource 
for real time rotation of a sound field environment so as 
to take into account any headphone movement relative to the 
desired sound field output. 

Alternatively, without headtracking, a virtual speaker 
system over headphones can be simulated by using a pair of 
filters for each virtual sound source and then a post 
mixing of the results to produce left and right signals. 
For example, turning initially to Fig. 1, if it is desired 
to simulate to a user 1 over headphones eg. 2, 3a virtual 
sound environment, with, for example, the environment 
comprising the popular Dolby DIGITAL (Trade Mark) 
environment which includes a left, 5, and right, 6 sound 
source in addition to a center cell source 7 and back left 
and right sound sources 8 and 9, then one form of suitable 
arrangement may be as illustrated 10 in Fig. 2. The 
arrangement 10 includes, for each channel eg. 11 providing 
a head related transfer function filter eg. 12, 13 for each 
input channel which maps the sound source to each of the 
left and right ears so as to form left and right headphone 
channels 16, 17. Similarly, each of the other channels is 
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similarly processed and the output summed to each head 
channel. The arrangement 10 in Fig- 2 is provided for a 
system that does not utilize headtracking . The arrangement 
of Fig. 2 requires significant length filters eg. 12, 13 
for each channel. Of course, many filter optimisations are 
possible in respect of the non treadtracked arrangement. 
An example of these optimisations include those disclosed 
in PCT Patent Application No. PCT AU99/00002 filed 6 
January, 1999 by the present applicant entitled "Audio 
Signal Processing Method and Apparatus". 

One possible method utilized by others to perform 
headtracking is to use an enormous amount of computational 
memory for storing a large number of sets of filter 
coefficients. For example, a set of filter coefficients 
could be stored for every angle around a listener (for full 
360 coverage) , then, each time the listener rotated their 
head the filter coefficients could be updated to reflect 
the new angle. A cross fade to the new filter coefficients 
would remove any unwanted artefacts. This technique has 
the significant disadvantage that it requires an enormous 
amount of memory to store the large number of filtered 
coefficients . 

An alternative technique is disclosed in US Patent No. 
5,659,619 by Abel which utilizes a process of principle 
component analysis where the head related transfer function 
is assumed to consist of several individual filter 
structures which are all modified from a look-up table 
according to a current head angle. This method provides 
for a reduction in memory requirements. 

However, it is only practical for short filters (short 
HRTF length) which provide for directionality of a sound 
source and it is not practical for a full room reverberant 
response in addition to the effective simulation of a full 
room 

It would be desirable to provide for a more efficient 
form of simulation of a sound surround environment over 
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headtracked headphones in addition to the effective 
simulation of a full room reverberant response. 
Summary of the Invention 

It is an object of the present invention to provide 
for a more efficient form of simulation of a surround sound 
environment over headtracked headphones. 

In accordance with a first aspect of the present 
invention, there is provided a method of simulating a 
spatial sound environment to a listener oyer headphones 
comprising inputting a series of sound signals having 
spatial components; determining a current orientation of 
the headphones around the listener; determining a mapping* 
function from a series of spatially static virtual speakers 
placed around the listener to each ear of the listener; 
utilising the current orientation to determine a current 
panning of the sound signals to the series of virtual 
speakers so as to produce a panned sound input signal for 
each of the virtual speakers; utilising the mapping 
function to map the panned sound input signal to each ear 
of the listener; and combining the mapped panned sound 
input signals to produce a left and right output signal for 
the headphones. 

Preferably, the virtual speakers include a set of 
simulated speakers placed at substantially equal angles 
around the listener which can be placed substantially in a 
horizontal plane around a listener or placed so as to fully 
surround a listener in three dimensions. The present 
invention has particular application wherein the series of 
sound signals comprise a Dolby DIGITAL encoding of a sound 
environment . 

In accordance with a second aspect of the present 
invention, there is provided an apparatus for simulating a 
spatial sound environment to a listener over headphones 
comprising input means for inputting a series of signals 
comprising a spatial -sound environment; panning means for 
panning the series of signals amongst a predetermined 
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number of virtual output signals to produce a plurality of 
virtual output speakers signals; head related transfer 
function mapping means for mapping the virtual output 
speaker signals to left and right headphone channel 
signals; and combining means for combining each of the 
left and right headphone channel signals into combined left 
and right headphone signals for playback over the 
headphones . 

Preferably, the panning means, the head related 
transfer function mapping means and the combining means are 
implemented in the form of a suitably programmed digital 
signal processor. 

Brief Description of the Drawings 

Notwithstanding any other forms which may fall within 
the scope of the present invention, preferred forms of the 
invention will now be described, by way of example only, 
with reference to the accompanying drawings in which: 

Fig. 1 illustrates the concept of a surround sound 
system; 

Fig. 2 illustrates a prior art arrangement for 
creating a surround sound environment over headphones; 

Fig. 3 illustrates the utilization of a virtual 
speaker system in accordance with the preferred embodiment; 

Fig. 4 is a schematic block diagram of the structure 
of the preferred embodiment; 

Figs. 5 and 6 illustrate the extension of the 
preferred embodiment to three dimensions; and 

Fig. 7 illustrates one form of implementation of the 
preferred embodiment. 

Description of Preferred and Other Embodiments 

In the preferred embodiment, a fixed filter and 
coefficient structure is utilized to simulate a stationary 
virtual speaker array and then a speaker panner is utilized 
to position the virtual sound sources at desired positions. 
The preferred embodiment will be discussed with reference 
to a Surround Sound implementation of the popular Dolby 
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DIGITAL format. 

Turning to Fig. 3, there is illustrated a method of 
the preferred embodiment 5 . The method of the preferred 
embodiment comprises utilizing a set of virtual speakers 
21-26 arranged around a listener 27. A head related 
transfer function to each ear of the listener 27 is 
calculated for each of the virtual speakers 21-26 arranged 
around a listener 27. The techniques utilized can be 
substantially the same as those described previously with 
reference to Fig. 2 and known in the prior art. 

A series of virtual surround sound speakers 31-35 are 
then utilized having a stable external reference frame 
relative to the user 27. Hence, as the user 27 turns their 
head, the virtual speaker 32 for example is panned between 
speakers 21-22 so as to locate the speaker 32 at the 
requisite point between speakers 21 and 22. Similar 
panning occurs for each of the other virtual surround sound 
speakers 32-35. Hence, each of the surround sound 
channel sources eg. 32 is panned between speakers so as to 
provide for the directionality of each sound source. The 
directionality of each sound source can be updated 
depending on the rotation of a listener's head and the 
speaker panning technique can be totally flexible and 
compatible with prior art panning techniques for 
conventional loudspeakers . 

Turning now to Fig. 4, there is illustrated one form 
of arrangement of the preferred embodiment 40. The 
preferred embodiment is based around two parts including a 
speaker panning section 41 and HRTF section 42. The HRTF 
section 42 includes the usual series of filters eg. 43, 44 
which map each of the virtual speakers 21-26 to the left 
and right ear of the listener 27. The filter coefficients 
being substantially static. 

The input channels for each of the surround sound 
sources 31-35. are input tQ 1 input to M output speaker 
panner 46. The speaker panner 4 6 also having as an input 
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47 the headtracking input signal from a listener's 
headphone. The speaker panner 4 6 can then be set to 
provide panning between the virtual output speakers 21-26 
which are output eg. 49. 

The technique of the preferred embodiment can be 
extended to provide for headtracking of elevation and roll 
of a user's head position where such information is 
available from the headtracking unit. This can be achieved 
by extending the location of the stationary virtual 
speakers to be in a three-dimensional cube around a 
listener. For example, if eight virtual speakers are 
simulated representing the eight corners of a cube around a 
listener, then any panning system can also compensate for 
head movements around a Y and Z plane. Hence, in addition 
to yaw, elevation and roll can also be taken into account. 
Of course, the more virtual speakers utilized to create the 
virtual speaker space around a listener, the better the 
accuracy of the system. Once again, panning can be 
provided by means of a front end system that utilizes the 
headtracked yaw, elevation and roll position to determine 
the panning effect between speakers. For example, as 
illustrated in Fig. 5, the elevation of a listener 55 can 
be determined via a standard headtracking unit and utilized 
to pan three-dimensional sound sources 56-59 around 
speakers 50-53 in accordance with the requirements. 
Similarly, as illustrated in Fig. 6, the roll of a user's 
head 55 can be utilized for panning the virtual sound 
sources 66-69 between virtual speakers 61-64 again as a 
pre-processing step. 

Turning now to Fig. 7, there is illustrated an example 
system 70 for implementation of the preferred embodiment. 
The system 70 includes a standard DVD digital input source 
71 which is fed to an DIGITAL decoder 72 which again can be 
standard. The DIGITAL decoder outputs center channel 73, 
front left and right channels 74, and surround or back left 
and right channels 75. The outputs 7 3-75 are fed to a DSP 
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processing board 7 6 which operates with an attached memory 
77. One form of suitable DSP processing board can be the 
Motorola 56002 EVM evaluation board card designed to be 
inserted into a PC type computer and directly programmed 
therefrom and having suitable Analogue/Digital and 
Digital/Analogue converters . 

A set of headphones 7 9 are provided which include 
headtracking capabilities in the form of an angular 
position circuit 80. The angular position circuit 80 
determines the yaw, elevation and roll and can comprise a 
Polhemus 3 space Insidetrak Tracking system available from 
Polhemus, 1 Hercules Drive, PO Box 560, Colchester, VT 
05446, USA. The output from the angular position circuit 
80 is converted to a digital form 81 for inputting to DSP 
chip 76. The DSP chip 7 6 is responsible for implementing 
the core functionality of Fig. 4, outputting two digital 
channels to digital to analogue converter 82 which in turn 
outputs analogue left and the right headphone speaker 
channel data which can be amplified 83, 84 in accordance 
with the requirements. The DSP chip 7 6 also implements the 
speaker panner mixing which pans the input sources 73-75 
according to the input angular position. Further, a filter 
array is provided within the DSP 76 which simulates the 
virtual speaker array of six speakers in accordance with 
the previously known prior art techniques. 

It would be therefore evident that the preferred 
embodiment provides for a simplified form of providing for 
full surround sound capabilities of the headtracked 
headphones in the presence of movement of the listener's 
head. 

It would be appreciated by a person skilled in the art 
that numerous variations and/or modifications may be made 
to the present invention as shown in the specific 
embodiment without departing from the spirit or scope of 
the invention as broadly described. The present embodiment 
is, therefore, to be considered in all respects to be 
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We Claim : 

1. A method of simulating a spatial sound 
environment to a listener over headphones comprising: 
inputting a series of sound signals having 

5 spatial components ; 

determining a current orientation of said 
headphones around said listener; 

determining a mapping function from a series of 
spatially . static virtual speakers placed around the 
10 listener to each ear of the listener; 

utilising said current orientation to determine a 
current panning of said sound signals to said series of 
virtual speakers so as to produce a panned sound input 
signal for each of said virtual speakers; 
15 utilising said mapping function to map said 

panned sound input signal to each ear of said listener; and 

combining said mapped panned sound input signals 
to produce a left and right output signal for said 
headphones . 

20 2. A method as claimed in claim 1 wherein said 

virtual speakers include a set of simulated speakers placed 
at substantially equal angles around said listener. 

3. A method as claimed in claim 1 wherein said 
virtual speakers are substantially in a horizontal plane 

25 around a listener. 

4. A method as claimed in claim 1 wherein said 
virtual speakers are placed so as to fully surround a 
listener in three dimensions. 

5. A method as claimed in claim 1 wherein said 

30 series of sound signals comprise a Dolby DIGITAL encoding 
of a sound environment. 

6. An apparatus for simulating a spatial sound 
environment to a listener over headphones comprising: 

input means for inputting a series of signals 
35 comprising a spatial sound environment; 
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panning means for panning said series of signals 
amongst a predetermined number of virtual output signals to 
produce a plurality of virtual output speakers signals; 

head related transfer function mapping means for 
5 mapping said virtual output speaker signals to left and 
right headphone channel signals; and 

combining means for combining each of said left 
and right headphone channel signals into combined left and 
right headphone signals for playback over said headphones. 
10 7. An apparatus as claimed in claim 6 wherein said 

panning means, said head related transfer function mapping 
means and said combining means are implemented in the form 
of a suitably programmed digital signal processor. 
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